Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[bugfix]: Fix null pointer de-reference when parsing statistics from cAdvisor #83

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

enp0s3
Copy link
Member

@enp0s3 enp0s3 commented Nov 21, 2024

Signed-off-by: Igor Bezukh [email protected]

Scenario 1: it can happen that API reports about a Pod, but the pod and its containers are not yet
fully created on the node cgroupfs, therefore there can be containers that are missing in the stats.

Scenario 2: Stats API has pointers to counters. Not all counters had default allocation of zero integer.

@enp0s3 enp0s3 force-pushed the npd-fix-cadvisor-stat branch 3 times, most recently from 29c9986 to 2de2b11 Compare November 21, 2024 14:53
@enp0s3
Copy link
Member Author

enp0s3 commented Nov 22, 2024

/cc @Barakmor1

@openshift-ci openshift-ci bot requested a review from Barakmor1 November 22, 2024 16:29
pkg/wasp/pod-ranker/rank_utils.go Outdated Show resolved Hide resolved
pkg/wasp/pod-ranker/rank_utils.go Outdated Show resolved Hide resolved
Copy link
Member

@Barakmor1 Barakmor1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Left some small comments overall looks great
Thanks for fixing these bugs!

there might be a situation where pod ranking is done on a freshly created
pod whose containers stats are not yet ready in cadvisor.

Signed-off-by: Igor Bezukh <[email protected]>
@enp0s3 enp0s3 force-pushed the npd-fix-cadvisor-stat branch from 2de2b11 to 8ff1f7c Compare January 6, 2025 13:43
Copy link

openshift-ci bot commented Jan 6, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please ask for approval from enp0s3. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

also fixed a bug in swap stats where swap usage stat was
assigned with swap available bytes.

Signed-off-by: Igor Bezukh <[email protected]>
@enp0s3 enp0s3 force-pushed the npd-fix-cadvisor-stat branch from 8ff1f7c to 39b1889 Compare January 6, 2025 13:49
Copy link
Member Author

@enp0s3 enp0s3 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Barakmor1 Hey!
Could you have another look please?

@@ -134,7 +135,23 @@ func exceedMemoryLimits(summary summaryFunc) cmpFunc {
func hasContainerExceedMemoryLimits(pod *v1.Pod, summary stats_collector.PodSummary) bool {
for _, container := range pod.Spec.Containers {
if rQuantity, ok := container.Resources.Limits[v1.ResourceMemory]; ok {
memoryAndSwapSumQuantity := memoryAndSwapUsage(*summary.Containers[container.Name].MemoryWorkingSetBytes, *summary.Containers[container.Name].MemorySwapCurrentBytes)
containerStats, exists := summary.Containers[container.Name]
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Barakmor1 I could avoid code duplication here by appending the regular containers and init containers into a single list, but I prefer to do it in a follow-up, especially that we lack tests to protect us from re-factoring

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants